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IMPROVED MANAGEMEHT OF AND ACCESS TO INFORMATION 
AND OTHER MATERIAL VIA THE WORLD WIDE WEB 

Field of invention 

The present invention relates to in^aroved management of and access 
to information, images and other material via the world wide Web Internet 
service. 

BackgrouuQd of invention 

A 'network' of computers can be any number of computers that are 
able to exchange information with one another. The computers may be 
arranged in any configuration and may be located in the same room or in 
different coiintries, so long as there is some way to connect them 
together (for exan^le, by telephone lines or other communication systems) 
so they can exchange information. Just as computers may be connected 
together to make up a network, networks may also be connected together 
through tools known as bridges amd gateways. These tools allow a COT5>uter 
in one network to exchange information with a computer in another 
network. • - - 

The Internet is a network of networks having no single owner or 
controller and including large and small, public and private networks, 
amd in which any connected computer running Internet Protocol software 
is, subject to security controls, capable of exchanging information with 
any other computer which is also connected to the Internet. This 
composite collection of networks which have agreed to connect to one 
another relies on no single tranismission medium (for exeunple, 
bidirectional communication can occur via satellite links, fiberoptic 
trunk lines, telephone lines, cable TV wires and local radio linJcs) . 

The World Wide Web Internet service ('Web' hereafter) is a wide 
area information retrieval facility which provides access to an enormous 
quantity of network- accessible information. Information about the World 
Wide web can be found in "Spinning the web" by Andrew Ford (International 
Thomson Publishing, London 1995) and "The World Wide Web Unleashed" by 
John Decanber and Neil Randall (SAMS Publishing, Indianapolis 1994) . Use 
of the Web is growing at an explosive rate because of its combination of 
flexibility, portjJsility and ease-of-use, coupled with interactive 
multimedia presentation capabilities. The Web allows any computer 
connected to the Internet and having the appropriate software and 



hardware configuration to retrieve emy document that has been made 
publicly available emywhere on the internet. The retrievable documents on 
the Web include 'HyperMedia' documents - i.e. documents which may be text 
dociatients or other forms of media such as sounds and images and which may 
have linJcs ('hyperlinks' - see below) to other documents. The format of 
such documents on the Web is a stetndard format in HTML (HyperText Markup 
Language) , such that a document created on one operating system and 
hardware platform can be read by a user on any other platform that has an 
appropriate Web Browser (see below) . HTML is associated with a specific 
communication protocol known as HyperText Trzuisf er Protocol (http) . 
Images may be stored in separate graphics files, for example in standard 
GIF or JPEG format, which are referenced in the HTML text for retrieval 
with the HTML text. 

Users access this information using a 'Web Browser', (also referred 
to as a 'Web Client Browser'), which is software installed on the user's 
conputer and having facilities for serving or retrieving documents from a 
Web Server via the Internet. Currently available Web Browsers include 
webExplorer cm) from IBM Corporation, Netscape Navigator frcan Netscape 
Communications Corporation, Internet Explorer from Microsoft Corporation, 
and Mosaic from NCSA. Such Browsers understand HTML and other Web 
standard formats and can display or output files correctly in these 
formats. The Web is structured as pages or files which each have a 
particular Universal Resource Locator (or URL) . The URL is a reference 
which denotes, amongst other things, both the server machine and the 
particular file or page on that machine. A user can type in particular 
URLS or jun© from one page to em associated page by means of 'hyperlinks' 
- that is, a word or symbol on a page can be associated with a URL for 
another page which is selectable to cause the Browser to send a request 
which retrieves, and then to display, the relevant page. The preferred 
user interface for such Browser selection is the graphical • point -and- 
click' interface (i.e. links are selected by moving a cursor to a 
particular word or symbol on display and then pressing a mouse button) . 
The words, images and symbols having associated hyperlinks are 
identifiable by a user as "hot spots" (for example, the relevant text may 
be highlighted or underlined, or the cursor may change its appearance as 
it passes over the hot spots) . There may be many pages resident on a 
single server, and associated hyperlinked pages may be located on 
different servers. 



Web pages are thus well known to be identifiable through URLs, such 
as http://www.pc.ibin.com/data.htm. This example illustrates three 
congxDnents of the URL: "http" identifies the protocol to be used by a Web 
client browser for access to the page; -www.pc. ibm.com" identifies the 
target computer (this computer name is converted to its numeric- form 
Internet address) ; and "data. htm" identifies the page to be accessed on 
that computer. More complex examples having additional parameters are 
also possible, such that specific data may be passed from the client 
coir5)uter to the server computer in a URL specification. 

In order to facilitate easy return to a particular web page at a 
later time without having to retrace the original steps which led to 
discovery of the target Web page, URLs and associated descriptors (»rtiich 
by default are taken from the web page, but are editable) can be saved as 
■bookmarks" at the client computer. Such a scheme is shown in Figure 1, 
in which URLs and descriptors for specific web pages 10 are stored as 
bookmarks 20 stored at a client computer system 30. user selection of 
such a bookmark at the client system initiates a request for downloading 
to the client system from a seirver system 40 of the respective Web page 
10. The user can then select hyperlinks 50 within a downloaded Web page 
to access other web pages of interest (which may be on other seirver 
ccxnputers as shown) . 

This scheme is commonplace and has proven extremely successful. 
However, it gives rise to a number of problems which affect both clients 
and servers. 



Froa a client perspective: 

o a URL stored in a bookmark may not be valid when re-used 

(e.g. the Web page has been deleted prior to the atteiiv)ted 
re -access) ; in this case the access fails and the user 
receives a generic failure message. It is not possible 

to supply the user with reasons for the Web page deletion or 
to provide alternative destinations of possible interest. 

o a URL stored in a bookmark may identify a busy Web page; at 
the time of re -use access may not be possible because of the 
demands of other users. Unfortunately it is not possible to 
re -direct the browser to an alternative Web page (which could 
have been set up to contain identical information) . 

FroB a server perspective: 

o re -organisation of a Web site is difficult because of the 

desire to preserve the integrity of URLs previously used and 
stored as bookmarks in unknown clients. 

o similarly, the embedded URLs in web pages mean that the 

movement of a web page requires hyperlinks in other pages to 
be updated to maintain the validity of the reference. 

o lists of alternative URLs need to be provided on web pages to 
cope with expected demand for the content; this is especially 
true when the URLs point to files to be downloaded (URLs of 
the form "ftp://...", where the prefix now indicates the use 
of the file transfer protocol) because each user consumes 
conaideraJale resources. Such lists are tiresome for users, 
take valuable page space, and do not provide effective load 
balancing across the multiple servers. 

Summary of invention 

in a first aspect of the invention there is provided an access 
mechanism for accessing material via the World Wide Web. Web pages 
typically include hyperlinks for accessing associated Web pages stored at 
the seune or at different web server con«)uters. According to the 
invention, at least some of these hyperlinks con^rise links to one or 
more directories and the directories store URLs for accessing particular 
Web pages. The access mechanism includes access logic, responsive to user 
selection at a web client system of a hyperlink comprising a link to one 
of the directories, for retrieving from said directory a particular one 



or plurality of said stored URLs and for accessing at least one of said 
particular web pages using said retrieved URL. 

Directories are repositories of objects (i.e. directory entries are 
known as objects) which are orgamised to support locating of those 
objects. Objects comprise one or more attribute -value pairs. For example, 
a 'person' object may have a 'name' attribute with value 'Steven Jones', 
and a 'telephone number' attribute with value '0171-815000'. Typically a 
directory is an hierarchical arremgement of named objects, where each 
object conqprises one or more attribute -values relating to that object. 
Directory look-up operations use structured queries which typically 
identify (1) a named object in the hierarchy for the start of a search, 
(2) a depth of search condition, (3) a set of attribute -value assertions 
to be satisfied by candidate objects, and (4) a set of attributes to be 
returned for the candidate objects. The type of directory objects (and 
the type of query) will vary for different directories. For exan«)le, an 
application may perform an attribute -value type query on an Access 
Control Server via a defined protocol such as LDAP (described below under 
'Detailed Description of Preferred Embodiments') to obtain a directory 
object ccai«)rising a person's access rights attributes. Another attribute- 
value query on a Network Name Server may seek network addresses for 
servers, printers and other devices. 

To enable locating of particular items of interest within the 
enormous quantity of information and other material (such as audio and 
video) *rtiich is available for public access from internet Web pages, it 
is known to organise web pages and files using directories and to access 
the directory objects using structured user- input directory- search 
queries. However, it is not known to provide URLs within Web page 
hyperlinks which are directory- reference URLs and to provide access logic 
for automatically retrieving Web page URLs stored in a directory vhea a 
user selects a hyperlink including a directory- reference URL. 

The present invention involves storing of Web page URLs as 
attribute- values of certain directory objects and providing Web page 
hyperlinks to those directory objects together with access logic 
responsive to the hyperlinks for retrieving the URLs for use by a client. 
This indirect access to Web pages via hyperlinks to directories has 
significant advantages for Web page organisation and facilitates more 
flexible methods of web page access than the known use of hyperlinks 
which include URLs pointing directly to the target Web pages. Use of the 



present invention within systems and methods in which access to Web pages 
and files is required gives a number of important advantages: 

o web pages within a site, or across sites, can be re-organised 
without invalidating directory- reference links stored in 
bookmarks; such links cein be responded to intelligently even 
when the original target web page has been deleted, or the 
client can be furnished with new links to content. 

o an 'intelligent' directory can furnish different indirect 
references to achieve effective load balancing. 

o indirect access through a directory provides an additional 
level of access control, and this can be used to make the 
reference supplied from the directory object dependent upon 
the identity of the client. 

o since the directory objects form an inventory of Web pages, 
Web page management is facilitated. 

o the directory can store index data in association with Web 

page URLs, emd access to the directory through a Web Browser 
supporting LDAP allows these index terms to be searched; this 
provides much richer access paths to required material, 
complementing the known hyperlink mechanism. 

The access logic for retrieving URLs emd accessing Web pages 
preferably conojrises an applet for execution on the web client system 
which is adapted to interact with a Web Browser on the client system and 
with a directory server, in the preferred embodiment of the invention, 
when a Web page having a directory- reference hyperlink is downloaded to a 
client system from a web server system and the directory- reference 
hyperlink is selected, a determination is made of whether the required 
applet is already available on the client system. If not, then the applet 
is requested from the directory server and is stored thereafter on the 
client system for use in future interactions with the directory server. 
Such applets could equally be stored at Web server computers together 
with Web pages which include directory- reference hyperlinks. 

The applet is executed on the client system when a directory- 
reference hyperlink is selected, the applet then initiating the sending 



of a request from the client system to a directory server to obtain the 
URL (or URLs) associated with the particular hyperlink. The directory 
server returns the URLs to the applet at the client system »rtiich passes 
at least a first one of the URLs to a Web Browser running at the client 
system. The web Browser then uses this URL to access the required web 
page as is known in the art. In this preferred embodiment, the Web 
Browser may be a conventional Browser operating in the normal way except 
for the performance of an intermediate directory access process prior to 
a Browser request being sent to the server which holds the reopaired Web 
page. This intermediate process preferably does not require any 
additional user interactions emd is 'invisible' to the user - a Web page 
or file is displayed to a user at the client system automatically 
following user selection of a directory -reference hyperlink. 
Alternatively, in embodiments in vdiich a plurality of URLs can be 
retrieved from the directory amd passed to the Web Browser by the applet, 
the user may be required to select from a list of obtained URLs. 

In em alternative embodiment of the invention, the access logic for 
retrieving URLs and accessing web pages is implemented as an integral 
feature of the program code of a modified web Browser rather than using a 
separate downloaded applet. That is, the web Browser itself responds to 
directory- reference hyperlinks by sending a request to a directory 
server. The Web Browser may also be responsible for automatic selection 
frran a plurality of retrieved URLs. The above may be referred to as 
'client -side' implementations of the invention. 

In alternative embodiments of the invention, the operations lAich 
are performed at a Web client system may be substantially unchanged from 
prior art Web browsing operations, but with retrieval of URLs from a 
directory in response to directory- reference hyperlinks being performed 
at the server. In one such embodiment, a server ccsnputer obtains 
requested Web pages from storage in response to a client request as is 
known in the art. The server computer is programmed to scan the retrieved 
Web pages for embedded directory- reference URLs. If a directory- reference 
URL is found, the server computer then contacts the directory to retrieve 
the specified directory object. The server processes the directory object 
to obtain a web page URL (an attribute -value of the object) and modifies 
the Web page to include this URL before returning the web page to the 
client. This may be referred to as a 'server-side' implementation of the 
invention. 



The scanning of Web pages to detect directory- reference URLs may, 
alternatively, be made conditional c»i the particular content of the web 
page, or on a logic prediction of the likelihood of user interaction with 
a directory- reference URL, or on an event such as the user at the client 
end having indicated a desire to interact with the directory- reference 
URL. 

According to another ' server -side' embodiment of the invention, 
when a user selects a directory-reference hyperlink within a Web page 
(for example from a bookmark at the client system) , a Web Browser request 
is invoked in the standard way except that the request uses the LDAP 
protocol emd is sent to a particular server coit^uter specified in the 
request, on which server the referenced directory is stored. The 
directory server con^uter responds to a received request by retrieving 
from the directory database the appropriate URL or URLs and returning one 
of these to the web Browser. The web Browser then sends a request to the 
server computer identified by the returned URL. It is not a preferred 
feature of this embodiment for multiple URLs to be returned to the 
client, but rather a single URL is returned to the client which then 
initiates access. Since the client then initiates Web page access using a 
single URL, the protocol requested by the client is consistent with the 
protocol returned to the client. 

A further server-side embodiment of the invention has more of the 
functions which are provided by the client computer system in the above - 
described embodiments inyplemented instead in a directory server system. 
Directory- reference hyperlinks invoke client requests to a directory 
server. The directory server both obtains one or more relevant URLs and 
sends requests to one or more relevant servers using the obtained URLs. 
Such reqiuests received from the directory server by a target server 
computer specify the directory server as the initial target destination 
for responses from the servers, rather t h a n responses being sent directly 
to the web Browser at the client system. This enables the directory 
server to perform selection between responses where a plurality of 
associated responses are received at the directory server within a preset 
time period. 

The invention according to one 'server-side' embodiment of the 
invention provides a method, in^jlemented by a Web server computer which 
is adapted to access one or more directories storing directory objects 
whose attributes include web page URLs, of processing web pages retrieved 



by a Web server computer in response to a request from a web client 
coit5)uter. The method comprises; 

accessing a storage means in response to a request from a web 
client computer and retrieving a requested Web page; 

scemning the retrieved page for directory- reference URLs specifying 
particular directory objects; 

on detection of a directory- reference URL, requesting the specified 
directory object from the referenced directory; 

on receipt of the requested directory object, processing the 
directory object to obtain a web. page URL and incorporating the Web page 
URL within the retrieved web page; 

returning the processed web page to the Web client computer. 

According to one ' client -side' embodiment of the invention, there 
is provided a method of accessing Web pages, iii«)lemented by a web client 
system which is adapted to access one or more directories storing 
directory objects whose attributes include Web page URLs, urtierein Web 
pages include hyperlinks for accessing associated web pages stored at Web 
server computers and at least some of said hyperlinks con^rise links to 
said one or more directories, the method comprising; 

responsive to user selection at a web client system of a hyperlink 
comprising a link to one of said directories and specifying a particular 
directory object, accessing said directory to retrieve the particular 
directory object; 

processing the directory object to obtain a Web page URL from the 
directory object; and 

accessing the Web page using the obtained URL. 

The invention according to a further aspect provides a data 
processing system comprising: 



40 



at least one client computer having a Web Browser installed 
thereon; 
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at least one server computer storing web pages identified by Web 
page URLs, certain of said web pages including hyperlinks for accessing 
associated Web pages; and 



a directory comprising at least one server coiqputer storing a 
directory database in which directory objects have Web page URLs as 
attributes; 

wherein at least some of said hyperlinks comprise links to the 
directory specifying particular directory objects and wherein the system 
includes access logic, responsive to user selection at a client conqjuter 
of a hyperlink con^rising a link to the directory, for retrieving from 
the directory a particular directory object, processing the directory 
object to obtain a Web page URL, euid accessing the Web page identified by 
the URL. 



The present invention is thus iin)lementable in a client data 
processing system or in a server data processing system of a distributed 
data processing network. 

20 

As well as supporting provision to a user at a client system, in 
response to invoking of a directory- reference hyperlink, of individual 
Web pages that were stored at Web servers, the present invention also 
supports 'cgi.bin' type requests which invoke (and provide data to) the 
25 cgi-bin program. Material presented to the user may be a dynamically 

generated file (for example, combining information from a plurality of 
different Web pages) . 



Brief description of drawings 

Preferred embodiments of the present invention will now be 
described in more detail, by way of example, with reference to the 
accompanying drawings in which: 

Figure 1 is a schematic representation of the prior art interaction 
between a client system and a server system in which the client requests 
access to required Web pages directly using URLs (which may be stored in 
bookmarks at the client) . Hyperlinks between web pages at the server 
enable hopping between associated pages. 
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Figure 2 is a schematic representation of a network of client and 
server conqputers in which is implemented a client -side embodiment the 
present invention; 

Figure 3 is a flow diagram representation of web page access via a 
directory according to an embodiment of the present invention; 

Figure 4 is a schematic representation of a client -server 
arrang«aent of computers in which the present invention may be 
implemented according to a server -side embodiment; and 

Figure 5 is a flow diagram showing the steps of web page access 
according to a server-side embodiment of the invention. 

Detailed description of preferred embodiments 

A client -server data processing network in which the present 
invention is implemented is represented schematically in Figure 2. A Web 
Browser program 100 is installed on a client conqputer system 110 and 
accesses web pages by sending requests to a web server con^)uter 120 
hosting Web pages in its local storage 130. The Web server is connected 
to the client system via the Internet. Only two client computer systems 
and two server computer systems are shown for simplicity. 

As is known in the prior art, a web server computer name is 
specified at the client system, by a user at the client system entering a 
URL for a Web page of interest (for exaii5>le, "http://www.ibm.com/News/" 
identifies a specific con«)uter "www" within the organisation identified 
by "ibm" within Internet class "com" (commercial)). Alternatively the 
user selects a bookmark stored at the client Browser which includes the 
ORL. 

The computer name is then converted by a name server facility into 
an a nvros c Internet address (of the form "29,5.19.66"), All TCP/IP 
based applications are aware of the name server facility and 
automatically go to a designated name seirver computer to request 
resolution of a coiv>uter name into an Internet address before attempting 
to make a connection to another cai^)uter in the network. The conversion 
of specified names to addresses involves a taQjle lookup of a list 
maintained by the name server con^uter. The storage of computer name- 
address lists, and the function of performing the conversion, is 
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distributed throughout the Internet. That is, a client server is 
configured with information regarding how to access its local naime server 
computer, and if an address cannot be determined locally then the local 
name server computer accesses amother name server computer within the 
hierarchy until the address is obtained. The Internet address is then 
provided to the client computer which can then use this address to send a 
request to the relevant server computer. 

The server cosnputer responds to a received request by retrieving 
frcMti its storage the specific file or page identified in the request and 
then sending it back to the client system. The Web Browser then manages 
displaying of the contents of the file at the client system. 

The present invention provides apparatus and a method for accessing 
Web pages and files of interest wherein a client system recjuest may 
differ from the specification of a particular target canputer and file in 
the conventional way described above. The present invention mfUces use of 
the wide adoption of directories for facilitating efficient access to web 
pages . 

A specific protocol has been proposed to provide access to the 
X.500 Directory without incurring the resource requirements of and 
without the complexity of the Directory Access Protocol (DAP) . This 
protocol, the Lightweight Directory Access Protocol (LDAP) , has been 
widely adopted for Internet applications and in particular for 
facilitating efficient access to the enormous quantity of data which is 
available in web pages and organised by directories. LDAP is described by 
Yeong et al in the Request For Comments dociment "Lightweight Directory 
Access Protocol", Internet RFC -1777, Performance Systems international. 
University of Michigaua, ISODE Consortium, March 1995. The general 
protocol model of LDAP con^rises clients performing protocol operations 
against servers. This is accon^lished by a client transmitting a protocol 
reqiuest describing the operation to be performed to a server, which is 
then responsible for performing the necessary operations on the 
Directory. Using LDAP, URL references for accessing objects within a 

directory are of the form "Idap:// The resolution of a server 

ccanputer identification within an LDAP URL into a numeric address is 
performed by a name server facility in the same memner described above in 
relation to standard internet addressing. 
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A directory lookup process then involves stepping through a 
hierarchy of directory objects in accordance with the particular client 
reqpiest. For exan^le, a named directory object specified in the request 
using its Distinguished Name 'uk.ibm.hursley. printers' is located by 
stepping down the directory hierarchy from the root to 'uk', then to 
'ibm', then 'hursley', then 'printers'. 



root 
—I 
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uk 
_l_ 



I ! I 
ibm 

I I 
hursley 



_!_ 



The request also specifies a depth of search condition, an 
attribute-value assertion and what attributes are to be returned to the 
requester. LDAP supports three depth alternatives: 0 (i.e. only the 
object itself), 1 (only the object's immediate children, and not itself), 
or n (the coit«)lete subtree of the hierarchy including the object itself 
and everything imdemeath) . In the present example, the depth condition 
may be 1 (for individual printer objects). An attribute -value assertion 
may be, for exaii©le, 'Paper=A3' (i.e. the capability of printing on A3 
paper) , and the attribute to be returned may be the locations of the 
printers meeting the criteria. Upon completion of these operations, the 
server returns a response containing any results or errors to the 
requesting client. 



LDAP, originally targeted at single management applications and 
Browser applications that provide simple read/write interactive access to 
the X.500 Directory, has now been widely adopted by the Internet 
community and is being proposed as a standard through the procedures of 
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the Internet Engineering Task Force (IETF) . in accord2ince with the prior 
art, directories have been used within the Internet environment for the 
storage of data such as organisational information, telephone data (e.g. 
telephone white and yellow pages) , and resource descriptions such as the 
location, capabilities and access parameters for printers and other 
devices . 

The present invention is in^lemented in a network (for exan^le, in 
the Internet environment or in an Intranet) including a plurality of 
client computer systems, many of which are running Web Browser programs, 
and a plurality of server computer systems which have web pages stored 
thereon, the Web pages being identifiable by specific URLs. A directory 
140 is accessaQsle through the LDAP protocol. 

Web pages held in storage at the server systems contain the usual 
embedded lin3cs including hypertext references to URLs (href =http ://...) . 
According to the invention, certain of the embedded links are specified 
as directory- reference URLs. That is, these embedded links include 
attribute contents of named LDAP directory objects. Such objects will be 
referred to as Page Pointer Objects (PPOs) hereafter, when a user selects 
such an embedded link from a web page, a message is sent to the named 
directory object. 

The LDAP directory contains one or more PPOs together with its more 
conventional contents. A PPO is an LDAP object which, according to the 
preferred embodiment, has the following characteristics; 

o It is identified by a Distinguished Name, following normal 
LDAP conventions. 

o It contains one or more attribute -value pairs, again 

following normal LDAP conventions. However, a number of the 
attributes have special significance for the present 
invention: 

a ContentURL contains an unordered list of URLs for a desired 
Web page. Each URL supplies an alternate address for the same 
page - multiple addresses being provided to enable high 
availability and support for load balancing. 



a FallbaclcURIi is similar to the ContentURL, except that the 
addresses point to an alternate page content which should be 
accessed if the desired page content cannot be reached through any 
of its URLs. 

a FailureURL is similar to a FallbackUSL except that the addresses 
point to a generic failure page content, accessed only if both the 
desired and fallback pages are unreachable. 

o It has a unique object class to identify the nature of the 
object. 

Two alternative embodiments of the invention, which are not 
mutually exclusive, will now be described in more detail. A 'client-side' 
implementation using a conventional Web server and resolving the 
directory- reference URLs through client activity will be described with 
reference to Figures 2 and 3. A 'server-side' in^lementation using a 
conventional Web client and resolving the directory- reference URLs 
through server activity will be described with reference to Figures 4 and 
5. 

Referring to Figure 2, a user working at client system 110 
typically retrieves a bookmark of interest from local storage euid a 
Browser at the client system makes an http recjuest to a web server 120 
for the page represented by the URL included in the request . The server 
120 retrieves the page from local storage 130 auid responds to the client 
with the page content. The web Browser controls displaying of the 
retrieved pages at the client system. The page will typically contain 
hyperlinks with embedded URLs. Certain pages contain one or more 
directory- reference URLs in the form: 

ldap://srvhost [:portl /dn[?ContentURL,FallbackURL, FailureURL [?base [?oc-PPO 
]]] . 

The syntax of the above uses square brackets [] to denote optional 
elements, 'srvhost' identifies the host to be contacted; 'port' 
identifies the port to be used; 'dn' is the Distinguished Name of the 
PPO; 'base' specifies that the base level object only must be retrieved; 
'oc-PPO' specifies that only objects of object class PPO must be 
retrieved, where named attributes are specified, this will reduce line 
traffic by eliminating non-essential object data and has the added 
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advemtage of allowing the PPO to liave other attributes without 
interfering with the indirect referencing operation. The base and object 
class definitions ensure that only the desired PPO object is returned, 
even in those cases when non-standard defaults have been set and/or the 
5 directory contents have been tampered with such that they no longer 

contain PPOs. 

Referring to Figure 3, if the user at the client system then 
cursor-clicks 180 on a directory -reference hyperlink of interest, the 

10 embedded directory- reference URL is retrieved. This URL is identified as 

a directory- reference URL and the client system performs 190 a 
determination of whether logic for performing a directory access function 
is available at the client system, if not then the client system requests 
200 downloading of a specific applet (which may, for exeuc^le, be written 

15 in the Java programming leuiguage) . The downloaded applet is then executed 

at the client system, sending 210 a request to the directory which 
specifies the directory- reference URL for a particular directory object 
(PPO) including its Distinguished Name. Such directory search requests 
are asynchronous and so multiple recjuests can be issued and the results 

20 independently processed as they are received. The directory returns 220 

the requested PPO to the applet running at the client system. 

The applet then processes 230 the PPO. Typically, a single URL is 
chosen at random from the ContentURL values and returned 240 to the Web 

25 Browser. The Web Browser then uses 250 this URL in an attempt to access 

the relevaint web page, if successful, then the directory access applet 
exits 270. If unsuccessful, other ContentURL values are chosen 230 for 
attempting access to the same page. If still unsuccessful, the Fallback 
values are chosen. This provision of alternative Web page URLs achieves a 

30 form of siir^le load balzmcing when a number of web pages having the same 

or related content are available at different servers. If still 
unsuccessful, then the FailureURL value is chosen. The contents of the 
Web page accessed by the FailureURL are specific to a particular PPO or 
set of PPOs and so the information displayed to the user at the client 

35 system will be more useful than a totally generic failure message such as 

is displayed in prior art systems. Each of the values ContentURL, 
FallbackURL and FailureURL may address the original web server computer 
or any other. According to this embodiment of the invention, multiple 
URLs are typically returned to the applet at the client system and it is 

40 the applet which chooses which URL or URLs to use in the subsequent 

client requests to web servers. 
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A server- side embodiment of the invention will now be described 
with, reference to Figures 4 and 5. According to this embodiment, a web 
server 120 as shown in Figure 4 performs the processing required to 
identify LDAP URLs within Web page hyperlinks, to resolve them via a 
directory 140, and to replace each original link with one based on 
relevant object data returned from the directory. 

The invention is implemented following a user at a client computer 
ir^jutting 300 a URL, or retrieving from system memory of the client 
con^uter a bookmark of interest which includes a URL, which invokes the 
Web Browser operation of making a request 310 to a Web server for display 
of the page represented by the URL (e.g. http://...). 

The server computer identified in the request accesses 320 the 
relevant store, which store may be held at an auxiliary disk storage 
device connected to the server con^uter, and retrieves the required Web 
page. As is known in the art, Web pages often include hyperlinks 
comprising embedded URLs. According to the present invention, one or more 
of the URLS embedded in certain of the Web pages are directory^ reference 
URLs in the form: 

Idap: //srvhost [ :port] /dn t?ContentURL, FallbackURL, FailureURL t?base t?oc«PPO 
]]] . 

The server scans 330 the retrieved Web page for such directory 
reference URLs, which exist in a web page within an HTML tag (href*...) . 
Upon identification of a directory reference URL, the server contacts 340 
the directory by sending an asynchronous request to retrieve the 
specified PPO. The directory returns 350 the requested PPO. 

The server then processes 360 the returned PPO, choosing a URL 
(firstly a ContentURL attribute value, and if that attribute cannot be 
found then a value from the FallbackURL attribute would be used, and 
failing that using a value from the FailureURL attribute) . The single 
selected URL is then returned 370 to the Web client system together with 
the retrieved and processed web page. 

The multi- fallback semantics provided for server-side 
implementations of the invention are generally not as rich as where the 
Web Browser does the processing, and this is a disadvantage of server- 
side inqplementations relative to client- side implementations of the 
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invention. The server according to this preferred embodiment does not 
access the referenced page on behalf of the client, but returns the URL 
to the client so that it can initiate the access. This ensures that the 
protocol requested by the client is consistent with the protocol returned 
to the client. Such a server-side approach may co-exist in a network with 
a client -side approach. 

It is within the scope of the present invention to make use of 
index terms which are additional to the Page Pointer Object references 
which are provided within Web pages. The PPO references form index terms 
to one or more Web pages through a directory. A server computer to which 
a request for web page access is sent necessarily receives and is 
responsive to certain information about the client system: the client's 
internet Protocol address (unless the access request went through a 
firewall, in which case it is the IP address of the firewall which the 
server receives) , the language of the client system, its operating 
system, etc. According to one embodiment of the present information, PPO 
references may be supplemented with other search terms in requests sent 
from a client system and a receiving server is responsive to such search 
terms and/or the received client system information such that the Web 
pages retrieved according to the present invention vary with the nature 
of the client system or with search terms provided by the client system. 

For exanqjle, a server system may hold Web pages including a 
compeuiy's software product manuals together with a list of addresses of 
customers' computers on which licensed copies of that software is 
installed. The server may be adapted to permit access to the product 
manuals only if a received request includes a client address which 
appears in the list, and to provide access to an alternative web page 
containing product ordering information when the client address is not 
recognised. 

Additionally, having stored Web page URLs within a directory in 
accordance with the invention, the Web page URLs can be associated with 
other indexing terms suitable for administration. Web page management is 
then simplified since the Web pages can be accessed using many different 
search criteria such as date of creation, language, or content. 
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CLAIMS 

1. An access mechanism for accessing material via the world wide web, 
wherein Web pages include hyperlinks for accessing associated Web pages 
stored at Web server con5)uters, characterised in that at least scnte of 
said hyperlinks comprise links to one or more directories storing URLs 
for accessing particular Web pages, the mechanism including: 

access logic, responsive to user selection at a web client system 
of a hyperlink ccMt^rising a link to one of said directories, for 
retrieving from said directory a particular one or plurality of said 
stored URLs and accessing at least one of said particular Web pages using 
said retrieved URL or URLs . 

2. An access mechanism according to claim 1, wherein the access logic 
for retrieving stored URLs and accessing web pages ccaiqprises an applet 
for execution on the web client system, the applet being responsive to 
user selection at the client system of a hyperlink con^rising a link to a 
directory on a directory server computer to send a request to said 
directory server computer for retrieval of said one or more URLs, and 
being responsive to receipt from the directory server computer of said 
retrieved URLs to pass at least one of said URLs to a Web Browser on the 
Web client system and to invoke the web Browser to access at least one of 
said particular Web pages using said URL. 

3. An access mechanism according to claim 1 or claim 2, v^erein the 
access logic con^rises an integral feature of a Web Browser con^uter 
program. 

4. An access mechamism according to claim 1, wherein the access logic 
is resi)onsive to selection of a hyperlink conqprising a link to a 
directory, which hyperlink includes a directory- reference URL specifying 
a named directory object, to reciuest the named directory object or 
related directory objects from a directory in which stored directory 
objects have particular URLs as attribute values, and on receipt of a 
directory object having one or more URLs as attribute values to process 
the directory object to obtain a URL. 



20 



5. An access mechemism according to claim 4, wherein directory objects 
include FallbackURLs identifying alternate web pages which are obtained 
for use in a client request when a first Web page identified by a first 
URL is unavailable. 

6. An access mechanism according to claim i or claim 5. wherein 
directory objects include FailureURLs identifying Web pages having 
content information relating to web page access failure which failure 
information is relevant to the received directory object. 

7. An access mechanism according to claim 1, wherein the access logic 
is responsive to selection of a hyperlink comprising a link to an LDAP 
directory and specifying a directory object Distinguished Naune, and 
optionally specifying additional directory search criteria, to request 
one or more directory objects frcan the LDAP directory and, on receipt of 
a directory object having one or more web page URLs as attribute values, 
to process the directory object to obtain a URL. 

8. An access mechanism according to any preceding claim, wherein the 
retrieval of URLs from the directory or provision of retrieved URLs to a 
client system is conditional on the identity of the client system or of 
an end user at the client system as specified in the client request. 

9 . An access mechanism according to any preceding claim, wherein the 
access logic is responsive to index data stored in a directory together 
with Web page URLs for invoking directory access search operations using 
said index data. 

10. A data processing system including access logic for use in 
accessing material via the world Wide web, wherein web pages include 
hyperlinks for accessing associated Web pages stored at web server 
caii5>uters; characterised in that 

at least some of said hyperlinks con^rise links to one or more 
directories storing URLs for accessing particular web pages, and 

the access logic is responsive to user selection at a Web client 
system of a hyperlink comprising a link to one of said directories, for 
retrieving from said directory a particular one or plurality of said 
stored URLs and accessing at least one of said particular web pages using 
said retrieved URL or URLs. 
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11. A method of accessing web pages, implemented by a Web client system 
which is adapted to access one or more directories storing directory 
objects whose attributes include Web page URLs, wherein web pages include 
hyperlinks for accessing associated Web pages stored at web server 
conyputers and at least some of said hyperlinks comprise links to said one 
or more directories, the method comprising: 

responsive to user selection at a web client system of a hyperlink 
comprising a link to one of said directories and specifying a particular 
directory object, accessing said directory to retrieve the particular 
directory object; 

obtaining a Web page URL from the directory object; and 

accessing the Web page using the obtained URL. 

12. A method, implemented in a Web server computer which is adapted to 
access one or more directories storing directory objects whose attributes 
include Web page URLs, of processing Web pages retrieved by a Web server 
con5)uter in response to a request from a web client con©uter, the method 
comprising; 

accessing a storage means in response to a request from a Web 
client conputer and retrieving a requested web page; 

scanning the retrieved page for directory- reference URLs specifying 
particular directory objects; 

on detection of a directory- reference URL, requesting the specified 
directory object or objects from the referenced directory; 

on receipt of a requested directory object, processing the 
directory object to obtain a Web page URL and incorporating the Web page 
URL within the retrieved Web page; 

returning the processed Web page to the Web client computer. 

13. A data processing system con^jrising: 

at least one client computer having a Web Browser installed 
thereon; 
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at least one server computer storing web pages identified by web 
page URLs, certain of said web pages including hyperlinks for accessing 
associated web pages; and 

5 a directory comprising at least one server computer storing a 

directory database in which directory objects have Web page URLs as 
attributes -values ; 



wherein at least some of said hyperlinks ccmprise links to the 
10 directory specifying psurticular directory objects and wherein the system 

includes access logic, responsive to user selection at a client computer 
of a hyperlink con^rising a link to the directory, for retrieving from 
the directory a particular directory object, processing the directory 
object to obtain a web page URL, and accessing the web page identified by 



14. A data processing system according to claim 13, wherein the access 
logic con^rises an applet stored at a server computer and available for 
transfer to a client ccw^puter in response to a request from the client 
system for execution at the client con^juter. 

15. A. method of providing an access mechanism for accessing material 
via the World Wide Web Internet service, wherein web pages include 
hyperliiiks for accessing material stored in associated Web pages, the 
method including the steps of: 



creating one or more directory objects within a directory, the 
directory objects having at tribute -values including URLs for accessing 
particular Web pages; 

30 

establishing hyperlinks within Web pages comprising references to 
said directory which references identify particular ones of said 
directory objects; and 

3 5 providing access logic for retrieving one or more of said URLs from 

said identified directory objects in response to user selection at a Web 
client system of a directory- reference hyperlink amd for accessing at 
least one of said particular web pages using said retrieved URL or URLs. 
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